Gene Characterization Index: Assessing the Depth of Gene Annotation

نویسندگان

  • Danielle Kemmer
  • Raf M. Podowski
  • Dimas Yusuf
  • Jochen Brumm
  • Warren Cheung
  • Claes Wahlestedt
  • Boris Lenhard
  • Wyeth W. Wasserman
چکیده

BACKGROUND We introduce the Gene Characterization Index, a bioinformatics method for scoring the extent to which a protein-encoding gene is functionally described. Inherently a reflection of human perception, the Gene Characterization Index is applied for assessing the characterization status of individual genes, thus serving the advancement of both genome annotation and applied genomics research by rapid and unbiased identification of groups of uncharacterized genes for diverse applications such as directed functional studies and delineation of novel drug targets. METHODOLOGY/PRINCIPAL FINDINGS The scoring procedure is based on a global survey of researchers, who assigned characterization scores from 1 (poor) to 10 (extensive) for a sample of genes based on major online resources. By evaluating the survey as training data, we developed a bioinformatics procedure to assign gene characterization scores to all genes in the human genome. We analyzed snapshots of functional genome annotation over a period of 6 years to assess temporal changes reflected by the increase of the average Gene Characterization Index. Applying the Gene Characterization Index to genes within pharmaceutically relevant classes, we confirmed known drug targets as high-scoring genes and revealed potentially interesting novel targets with low characterization indexes. Removing known drug targets and genes linked to sequence-related patent filings from the entirety of indexed genes, we identified sets of low-scoring genes particularly suited for further experimental investigation. CONCLUSIONS/SIGNIFICANCE The Gene Characterization Index is intended to serve as a tool to the scientific community and granting agencies for focusing resources and efforts on unexplored areas of the genome. The Gene Characterization Index is available from http://cisreg.ca/gci/.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of Prognostic Genes in Her2-enriched Breast Cancer by Gene Co-Expression Net-work Analysis

Introduction: HER2-enriched subtype of breast cancer has a worse prognosis than luminal subtypes. Recently, the discovery of targeted therapies in other groups of breast cancer has increased patient survival. The aim of this study was to identify genes that affect the overall survival of this group of patients based on a systems biology approach. Methods: Gene expression data and clinical infor...

متن کامل

Clustering of a Number of Genes Affecting in Milk Production using Information Theory and Mutual Information

Information theory is a branch of mathematics. Information theory is used in genetic and bioinformatics analyses and can be used for many analyses related to the biological structures and sequences. Bio-computational grouping of genes facilitates genetic analysis, sequencing and structural-based analyses. In this study, after retrieving gene and exon DNA sequences affecting milk yield in dairy ...

متن کامل

Molecular characterization of the lipL41 gene of Leptospira interrogans vaccinal serovars in Iran

Leptospirosis caused by infection with pathogenic leptospires, which is the most prevalent zoonotic disease in the world. The outer membrane proteins (OMPs) of pathogenic leptospires such as LipL41 play a crucial role in pathogenesis of this disease. Therefore a major challenge to develop an effective vaccine against leptospirosis is application of basic research on the OMPs of leptospires to i...

متن کامل

Characterization of Iranian Avian Metapneumovirus based on Fusion Gene (F)

  Avian metapneumovirus (aMPV) represents one of the most prevalent diseases of poultry mainly in combination with other pathogens, and it is increasing among chickens.  In the present study, the detection and characterization of an aMPV subtype B strain circulating in broiler flocks based on fusion (F) gene. In phylogenetic analysis, the isolates are located in B subtype cl...

متن کامل

PURIFICATION AND CHARACTERIZATION OF THE CLONED HUMAN GM-CSF GENE EXPRESSED IN ESCHERICHIA COLI

The human granulocyte-macrophage colony stimulation factor (hGM-CSF) gene was cloned in the pET 23a( +) expression vector under the control of strong bacteriophage T7 transcription and translation signals. The hGM-CSF gene was transferred into E. coli strainBL21 (DE3)pLysS andIPTG was used for induction of GM-CSF gene. Production of the target protein was obtained as revealed by ELISA and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • PLoS ONE

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2008